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Claims 



1 1 . A method for detecting moving video objects in a compressed digital 

2 bitstream which represents a sequence of fields or frames of video 

3 information for one or more previously captured scenes of video, 

4 comprising the steps of: 

5 a. analyzing said compressed bitstream to locate scene cuts therein, 

6 thereby determining at least one sequence of fields or frames of 

7 video information which represents a single video scene; 

8 b. estimating one or more operating parameters for a camera which 

9 initially captured said video scene by analyzing a portion of said 

10 compressed bitstream which corresponds to said video scene; and 

1 1 c. detecting one or more moving video objects represented in said 

12 compressed bitstream by applying global motion compensation with 

1 3 said estimated operating parameters . 

1 2. The method of claim 1, further comprising the step of extracting visual 

2 features of said one or more detected moving video objects from said 

3 compressed bitstream. 

1 3. The method of claim 1 , wherein said compressed bitstream comprises a 

2 bitstream compressed in accordance with the MPEG video standard. 

1 4. The method of claim 1, wherein said analyzing step further comprises the 

2 steps of: 

3 a. parsing said compressed bitstream into blocks of video information 

4 and associated motion vector information for each field or frame of 

5 video information which comprises the determined sequence of 

6 fields or frames of video information representative of said single 

7 scene; 
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8 b. performing inverse motion compensation on each of said parsed 

9 blocks of video information to derive discrete cosign transform 

10 coefficients for each of said parsed blocks of video information; 

11 c. counting said motion vector information associated with each of 

1 2 said parsed blocks of video information; and 

1 3 d. Determining from said counted motion vector information and said 

14 discrete cosign transform coefficient information whether one of 

1 5 said scene cuts has occurred. 

1 5. The method of claim 1 , wherein said analyzing step comprises parsing said 

2 compressed bitstream into blocks of video information and associated 

3 motion vector information for each field or frame of video information 

4 which comprises the determined sequence of fields or frames of video 

5 information representative of said single scene, and wherein said estimating 

6 step comprises the step of estimating any zoom and any pan of said camera 

7 by determining a multi-parameter transform model applied to said parsed 

8 motion vector information. 

1 6. The method of claim 5, wherein said estimating step comprises the steps of: 

2 a. computing each parameter for a multi-parameter affine transform 

3 which represents a transformation from a current frame of video 

4 information to a previous frame of video; and 

5 b. computing said multi-parameter affine transform to thereby 

6 determine global motion information representative of said zoom 

7 and pan of said camera. 

1 7. The method of claim 6, wherein said detecting step comprises computing 

2 local object motion for said one or more moving video objects based on 

3 said global motion information and on one or more of said motion vectors 

4 which correspond to said one or more moving video objects. 
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1 8. The method of claim 7, further comprising the steps of : 

2 a. determining whether said local object motion is greater than a 

3 predetermined threshold; 

4 b. applying morphological operations to said determined local object 

5 motion values to eliminate any erroneously sensed moving objects; 

6 and 

7 c. determining border points of said detected moving objects to 

8 thereby locate a bounding box for said detected moving object. 

1 9. An apparatus for detecting moving video objects in a compressed digital 

2 bitstream which represents a sequence of fields or frames of video 

3 information for one or more previously captured scenes of video, 

4 comprising: 

5 a. means for analyzing said compressed bitstream to locate scene cuts 

6 therein and to determine at least one sequence of fields or frames of 

7 video information which represents a single video scene; 

8 b. means, coupled to said analyzing means, for estimating one or more 

9 operating parameters for a camera which initially viewed said video 

1 0 scene by analyzing a portion of said compressed bitstream which 

1 1 corresponds to said video scene; and 

1 2 c. means, coupled to said estimating means, for detecting one or more 

1 3 moving video objects represented in said compressed bitstream by 

14 applying global motion compensation with said estimated operating 

15 parameters. 

1 10. The apparatus of claim 9, further comprising means, coupled to said 

2 detecting means, for extracting visual features of said one or more detected 

3 moving video objects from said compressed bitstream. 
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1 11. The apparatus of claim 9, wherein said compressed bitstream comprises a 

2 bitstream compressed in accordance with the MPEG video standard, and 

3 wherein said analyzing means further comprises: 

4 a. parsing means for receiving and parsing said compressed bitstream 

5 into blocks of video information and associated motion vector 

6 information for each field or frame of video information which 

7 comprises the determined sequence of fields or frames of video 

8 information representative of said single scene; 

9 b. means, coupled to said parsing means, for performing inverse 

1 0 motion compensation on each of said parsed blocks of video 

1 1 information to derive discrete cosign transform coefficients for each 

12 of said parsed blocks of video information; 

13 c. counting means, coupled to said inverse motion compensation 

14 means, for counting said motion vector information associated with 

15 each of said parsed blocks of vide information; and 

16 d. determining means, coupled to said counting mens, for determining 

17 from said counted motion vector information and said discrete 

1 8 cosign transform coefficient information whether one of said scene 

19 cuts has occurred. 

1 12. The apparatus of claim 9, wherein said analyzing means further comprises 

2 means for parsing said compressed bitstream into blocks of video 

3 information and associated motion vector information for each field or 

4 frame of video information which comprises the determined sequence of 

5 fields or frames of video information representative of said single scene, 

6 and wherein said estimating means further comprises means for estimating 

7 any zoom and any pan of said camera by determining a multi-parameter 

8 transform model applied to said parsed motion vector information. 
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1 13. The apparatus of claim 12, wherein said estimating means further 

2 comprises: 

3 a. means for computing each parameter for a multi-parameter affine 

4 transform which represents a transformation from a current frame of 

5 video information to a previous frame of video; and 

6 b. means, coupled to said transform parameter computing means, for 

7 computing said multi-parameter affine transform to thereby 

8 determine global motion information representative of said zoom 

9 and pan of said camera. 

1 14. The apparatus of claim 12, wherein said detecting means further comprises 

2 means for computing local object motion for said one or more moving 

3 video objects based on said global motion information and on one or more 

4 of said motion vectors which correspond to said one or more moving video 

5 objects. 



1 15. The apparatus of claim 14, further comprising: 

2 a. comparison means, coupled to said local object motion computing 

3 means, for determining whether said local object motion is greater 

4 than a predetermined threshold; 

5 b. morphological operation means, coupled to said comparison 

6 means, for determined local object motion values to eliminate any 

7 erroneously sensed moving objects; and 

8 c. border point determination means, coupled to said morphological 

9 operation means, for determining border points of said detected 

10 moving objects to thereby locate a bounding box for said detected 

1 1 moving object. 
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1 16. A method for dissolving an incoming scene of video information which 

2 comprises a sequence of fields or frame of compressed video information 

3 and an outgoing scene of video information which comprises a sequence of 

4 fields or frame of compressed video information, comprising the steps of: 

5 a. applying DCT domain motion inverse compensation to obtain DCT 

6 coefficients for all blocks of video information which make up a last 

7 frame of said outgoing video scene; 

8 b. applying DCT domain inverse motion compensation to obtain the 

9 DCT coefficients for all blocks of video information which make up 

1 0 the first frame of said incoming video scene; and 

11 c. creating a first frame in a dissolve region from said DCT 

12 coefficients of said last outgoing frame and said first incoming 

13 frame. 

1 1 7. The method of claim 1 6, further comprising the step of choosing an initial 

2 value for a weighing function prior to step (c). 

1 18. The method of claim 1 7, further comprising the steps of 

2 d. incrementing said weighting function value; and 

3 e. creating a second frame in said dissolve region from said DCT 

4 coefficients of said last outgoing frame and said first incoming 

5 frame using said incremented weighing function value. 

1 19. A method for masking a region of a compressed frame of digital video 

2 information, comprising the steps of : 

3 a. determining whether said frame to be masked is intra-coded, 

4 predictive-coded or bi-directionally predictive-coded; 

5 b. if said frame is intra-coded: 

6 i. extracting DCT coefficients for all blocks within said frame; 

7 ii. examining block,, to determine where in said frame said 

8 block is located; 
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9 iii. setting said DCT coefficients for said block to zero if said 

1 0 block is outside said mask region; 

1 1 iv. applying a DCT cropping algorithm to said DCT 

12 coefficients if said block is on the boundary of said mask 

13 region; and 

14 v. repeating steps (b)(ii) - (b)(iv) for each block in said frame; 

1 5 c. If said frame is predictive-coded or bi-directionally predictive- 

16 coded: 

17 i. examining motion vectors associated with block„ to 

1 8 determine whether they point to blocks outside or on said 

19 mask region; 

20 ii. reencoding said block if a motion vector points to blocks 

2 1 outside or on said mask region; and 

22 iii. repeating steps (c)(i) - (c)(ii) for all blocks in said frame. 

1 20. A method for generating a reduced speed sequence of frames of video 

2 information from a sequence of frames of compressed video information, 

3 comprising the steps of: 

4 a. selecting a frame of compressed video information to be repeated; 

5 b. determining whether said frame to be repeated is intra-coded, 

6 predictive-coded or bi-directionally predictive-coded; 

7 c. converting said frame into an intra-coded frame if said frame is a 

8 predictive-coded or bi-directionally predictive-coded frame; 

9 d. creating duplicate predictive-coded frames; and 

10 e. arranging said determined frame and said duplicate predictive-coded 

1 1 frames into a sequence of compressed frames of video information. 



1 

2 



21. 



The method of claim 20, wherein said reduced speed sequence of frames 
generates a frozen frame effect. 
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1 22. The method of claim 20, further comprising the step of converting said 

2 frame into an intra-coded frame if said frame is a predictive-coded or bi- 

3 directionally predictive-coded frame. 

1 23. A system for editing compressed video information over a distributed 

2 network, comprising: 

3 a. a client computer; 

4 b. a network link, coupled to said client computer, for permitting said 

5 client computer to search for and locate compressed video 

6 information on said distributed network; and 

7 c. means for editing a compressed bitstream of video information over 

8 said distributed network. 

1 24. The system of claim 21, wherein said editing means includes means for 

2 dissolving an incoming scene of video information which includes a 

3 sequence of fields or frame of compressed video information and an 

4 outgoing scene of video information which includes a sequence of fields or 

5 frame of compressed video information, said dissolving means comprising: 

6 a. outgoing motion compensation means for applying DCT domain 

7 motion compensation to a last frame of said outgoing video scene to 

8 obtain DCT coefficients for all blocks of video information which 

9 make up said last frame of said outgoing video scene; 

10 b. incoming motion compensation means, for applying DCT domain 

1 1 motion compensation to a first frame of said incoming video scene 

12 to obtain the DCT coefficients for all blocks of video information 

13 which make up said first frame of said incoming video scene; and 

14 c. dissolve region creating means, coupled to said incoming motion 

15 compensation means and to said outgoing motion compensation 

16 means, for creating a first frame in a dissolve region from said DCT 

17 coefficients of said last outgoing frame and said first incoming 

18 frame. 
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1 25. The system of claim 2 1 , wherein said editing means includes means for 

2 masking a region of a compressed frame of digital video information, said 

3 masking means comprising: 

4 a. means for determining whether said frame to be masked is intra- 

5 coded, predictive-coded or bi-directionally predictive-coded; 

6 b. means for processing intra-coded frames including: 

7 i. means for receiving DCT coefficients and for extracting 

8 DCT coefficients for all blocks within said frame; 

9 ii. means, coupled to said receiving means, for examining 

10 blocks of compressed video information in said frame to 

1 1 determine where in said frame said block is located; 

12 iii. means, coupled to said examining means, for setting said 

1 3 DCT coefficients for said block to zero if said block is 

14 outside said mask region; and 

1 5 iv. means, coupled to said setting means, for applying a DCT 

16 translation algorithm to said DCT coefficients if said block 

17 is on the boundary of said mask region; and 

1 8 c. means for processing predictive-coded and bi-directionally 

1 9 predictive-coded frames, including: 

20 i. means for receiving and examining motion vectors 

21 associated with blocks of video information to determine 

22 whether they point to blocks outside or on said mask region; 

23 and 

24 ii. means, coupled to said receiving means, for reencoding said 

25 block if a motion vector points to blocks outside or on said 

26 mask region. 
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1 26. The system of claim 2 1 , wherein said editing means includes means 

2 generating a reduced speed sequence of frames of video information from a 

3 sequence of frames of compressed video information, comprising: 

4 a. selection means for selecting a frame of compressed video 

5 information to be repeated; 

6 b. computational means, coupled to said selection means, for 

7 determining whether said frame to be repeated is intra-coded, 

8 predictive-coded or bi-directionally predictive-coded; 

9 c. frame generating means, coupled to said computational means and 

10 to said converting means, for creating duplicate predictive-coded 

1 1 frames; and 

12 d. frame arranging means, coupled to said frame generating means, for 

13 arranging said determined frame and said duplicate predictive-coded 

14 frames into a sequence of compressed frames of video information. 

1 27. A method for converting a full resolution compressed domain bitstream 

2 into a reduced resolution compressed bitstream, comprising the steps of: 

3 a. examining a frame of compressed video information from said full 

4 resolution bitstream; 

5 b. determining whether said examined frame is intra-coded, predictive- 

6 coded or bi-directionally predictive-coded; 

7 c. extracting DCT DC coefficients for said determined frame if said 

8 determined frame is intra-coded 

9 d. applying DCT domain inverse motion compensation to said frame 

10 to extract DCT DC coefficients if said frame is predictive-coded or 

1 1 bi-directionally predictive-coded; and 

12 e. converting said extracted DCT DC coefficients into DCT DC 

13 coefficients for a reduced size intra-coded frame of video. 



